Truncation error and dynamics in very low latency phonetic recognition

نویسنده

  • Giampiero Salvi
چکیده

The truncation error for a two-pass decoder is analyzed in a problem of phonetic speech recognition for very demanding latency constraints (look-ahead length < 100ms) and for applications where successive refinements of the hypotheses are not allowed. This is done empirically in the framework of hybrid MLP/HMM models. The ability of recurrent MLPs, as a posteriori probability estimators, to model time variations is also considered, and its interaction with the dynamic modeling in the decoding phase is shown in the simulations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation

Highly accurate speech recognition with very low latency is a big challenge but also an important requirement for modern real-time speech recognition applications such as speech-to-speech translation. We attack this problem by proposing a highly effective and efficient streaming mode decoding scheme. A novel multi-layered feature streaming method is introduced to minimize truncation errors duri...

متن کامل

شبیه سازی و ارزیابی شاخص تکاپوی آب سد مخزنی شهید یعقوبی با استفاده از روش تحلیل پویایی سیستم

Water resources simulation is efficient tools to evaluate different options and decision in development conditions. Supply of water demand with high reliability need to exact and perfect planning. So, dam behavior recognition and it operation is from essentials of water resources systems management and future planning. In this study, the software VENSIM the method based on the dynamics of the s...

متن کامل

Fourier Model Reduction for Large-Scale Applications in Computational Fluid Dynamics

A new method, Fourier model reduction (FMR), for obtaining stable, accurate, low-order models of very large linear systems is presented. The technique draws on traditional control and dynamical system concepts and utilizes them in a way which is computationally very efficient. Discrete-time Fourier coefficients of the large system are calculated and used to construct a reduced-order model that ...

متن کامل

The Effect of English Vowel-Recognition Training on Beginner and Advanced Iranian ESL Learners

This study was an attempt to investigate the effect of vowel-recognition training on beginner and advanced Iranian ESL learners. A total of 36 adult Iranian ESL learners (18 advanced and 18 beginners) who were students of various majors at Memorial University (MUN) were recruited for the study. Advanced participants had the experience of living in Canada for at least three years while beginners...

متن کامل

Segment Boundaries in Low Latency Phonetic Recognition

This study analyses how the reduction of the look-ahead length of a two pass phonetic decoder influences the alignment of the segment boundaries. It is shown how the optimization of some tuning parameters, such as the insertion penalty, is dependent on the look-ahead length. It is also suggested that the insertion penalty be dynamically adjusted to some measure of similarity of the phonetic seg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003